Overview
Brought to you by YData
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 29531 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.2 MiB |
| Average record size in memory | 292.8 B |
Variable types
| Categorical | 2 |
|---|---|
| DateTime | 1 |
| Numeric | 12 |
AQI is highly overall correlated with AQI_Bucket and 2 other fields | High correlation |
AQI_Bucket is highly overall correlated with AQI | High correlation |
Benzene is highly overall correlated with Toluene | High correlation |
NO is highly overall correlated with NOx | High correlation |
NO2 is highly overall correlated with NOx | High correlation |
NOx is highly overall correlated with NO and 1 other fields | High correlation |
PM10 is highly overall correlated with AQI and 1 other fields | High correlation |
PM2.5 is highly overall correlated with AQI and 1 other fields | High correlation |
Toluene is highly overall correlated with Benzene | High correlation |
Benzene is highly skewed (γ1 = 23.63338029) | Skewed |
NOx has 740 (2.5%) zeros | Zeros |
CO has 2328 (7.9%) zeros | Zeros |
Benzene has 3802 (12.9%) zeros | Zeros |
Toluene has 2861 (9.7%) zeros | Zeros |
Reproduction
| Analysis started | 2025-06-02 04:38:30.783839 |
|---|---|
| Analysis finished | 2025-06-02 04:38:50.658073 |
| Duration | 19.87 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
City
Categorical
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Ahmedabad | |
|---|---|
| Bengaluru | |
| Chennai | |
| Mumbai | |
| Lucknow | |
| Other values (21) |
Length
| Max length | 18 |
|---|---|
| Median length | 12 |
| Mean length | 8.2757441 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ahmedabad |
|---|---|
| 2nd row | Ahmedabad |
| 3rd row | Ahmedabad |
| 4th row | Ahmedabad |
| 5th row | Ahmedabad |
Common Values
| Value | Count | Frequency (%) |
| Ahmedabad | 2009 | 6.8% |
| Bengaluru | 2009 | 6.8% |
| Chennai | 2009 | 6.8% |
| Mumbai | 2009 | 6.8% |
| Lucknow | 2009 | 6.8% |
| Delhi | 2009 | 6.8% |
| Hyderabad | 2006 | 6.8% |
| Patna | 1858 | 6.3% |
| Gurugram | 1679 | 5.7% |
| Visakhapatnam | 1462 | 5.0% |
| Other values (16) | 10472 |
Length
| Value | Count | Frequency (%) |
| ahmedabad | 2009 | 6.8% |
| bengaluru | 2009 | 6.8% |
| chennai | 2009 | 6.8% |
| mumbai | 2009 | 6.8% |
| lucknow | 2009 | 6.8% |
| delhi | 2009 | 6.8% |
| hyderabad | 2006 | 6.8% |
| patna | 1858 | 6.3% |
| gurugram | 1679 | 5.7% |
| visakhapatnam | 1462 | 5.0% |
| Other values (16) | 10472 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 46303 | |
| r | 21033 | 8.6% |
| u | 15396 | 6.3% |
| n | 15294 | 6.3% |
| h | 13678 | 5.6% |
| i | 13664 | 5.6% |
| e | 11353 | 4.6% |
| m | 10991 | 4.5% |
| d | 8334 | 3.4% |
| t | 8306 | 3.4% |
| Other values (28) | 80039 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 244391 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 46303 | |
| r | 21033 | 8.6% |
| u | 15396 | 6.3% |
| n | 15294 | 6.3% |
| h | 13678 | 5.6% |
| i | 13664 | 5.6% |
| e | 11353 | 4.6% |
| m | 10991 | 4.5% |
| d | 8334 | 3.4% |
| t | 8306 | 3.4% |
| Other values (28) | 80039 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 244391 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 46303 | |
| r | 21033 | 8.6% |
| u | 15396 | 6.3% |
| n | 15294 | 6.3% |
| h | 13678 | 5.6% |
| i | 13664 | 5.6% |
| e | 11353 | 4.6% |
| m | 10991 | 4.5% |
| d | 8334 | 3.4% |
| t | 8306 | 3.4% |
| Other values (28) | 80039 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 244391 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 46303 | |
| r | 21033 | 8.6% |
| u | 15396 | 6.3% |
| n | 15294 | 6.3% |
| h | 13678 | 5.6% |
| i | 13664 | 5.6% |
| e | 11353 | 4.6% |
| m | 10991 | 4.5% |
| d | 8334 | 3.4% |
| t | 8306 | 3.4% |
| Other values (28) | 80039 |
Date
Date
| Distinct | 2009 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 230.8 KiB |
| Minimum | 2015-01-01 00:00:00 |
|---|---|
| Maximum | 2020-07-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
PM2.5
Real number (ℝ)
High correlation 
| Distinct | 11716 |
|---|---|
| Distinct (%) | 39.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.510857 |
| Minimum | 0.04 |
|---|---|
| Maximum | 949.99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0.04 |
|---|---|
| 5-th percentile | 14.14 |
| Q1 | 32.15 |
| median | 48.57 |
| Q3 | 72.45 |
| 95-th percentile | 180.215 |
| Maximum | 949.99 |
| Range | 949.95 |
| Interquartile range (IQR) | 40.3 |
Descriptive statistics
| Standard deviation | 59.807551 |
|---|---|
| Coefficient of variation (CV) | 0.92709281 |
| Kurtosis | 25.559898 |
| Mean | 64.510857 |
| Median Absolute Deviation (MAD) | 18.75 |
| Skewness | 3.7383702 |
| Sum | 1905070.1 |
| Variance | 3576.9432 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48.57 | 4600 | 15.6% |
| 11 | 19 | 0.1% |
| 20.75 | 12 | < 0.1% |
| 27.82 | 11 | < 0.1% |
| 29.75 | 10 | < 0.1% |
| 11.81 | 10 | < 0.1% |
| 18.81 | 10 | < 0.1% |
| 28.45 | 10 | < 0.1% |
| 47.43 | 10 | < 0.1% |
| 15 | 10 | < 0.1% |
| Other values (11706) | 24829 |
| Value | Count | Frequency (%) |
| 0.04 | 1 | |
| 0.16 | 1 | |
| 0.24 | 1 | |
| 0.28 | 1 | |
| 0.98 | 1 | |
| 0.99 | 1 | |
| 1.14 | 1 | |
| 1.19 | 1 | |
| 1.25 | 1 | |
| 1.39 | 1 |
| Value | Count | Frequency (%) |
| 949.99 | 1 | |
| 917.77 | 1 | |
| 916.67 | 1 | |
| 914.94 | 1 | |
| 914.64 | 1 | |
| 894.75 | 1 | |
| 868.66 | 1 | |
| 858.73 | 1 | |
| 832.8 | 1 | |
| 821.42 | 1 |
PM10
Real number (ℝ)
High correlation 
| Distinct | 12571 |
|---|---|
| Distinct (%) | 42.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 109.65937 |
| Minimum | 0.01 |
|---|---|
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 31.605 |
| Q1 | 79.315 |
| median | 95.68 |
| Q3 | 111.88 |
| 95-th percentile | 255.335 |
| Maximum | 1000 |
| Range | 999.99 |
| Interquartile range (IQR) | 32.565 |
Descriptive statistics
| Standard deviation | 72.32402 |
|---|---|
| Coefficient of variation (CV) | 0.65953345 |
| Kurtosis | 13.209428 |
| Mean | 109.65937 |
| Median Absolute Deviation (MAD) | 16.27 |
| Skewness | 2.8554438 |
| Sum | 3238350.7 |
| Variance | 5230.7639 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 95.68 | 11142 | |
| 94 | 9 | < 0.1% |
| 33.81 | 7 | < 0.1% |
| 20.53 | 6 | < 0.1% |
| 43.1 | 6 | < 0.1% |
| 109.67 | 6 | < 0.1% |
| 72.04 | 6 | < 0.1% |
| 39.46 | 6 | < 0.1% |
| 102.17 | 6 | < 0.1% |
| 84.08 | 6 | < 0.1% |
| Other values (12561) | 18331 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.02 | 1 | |
| 0.03 | 1 | |
| 0.04 | 2 | |
| 0.06 | 1 | |
| 0.07 | 1 | |
| 0.13 | 2 | |
| 0.14 | 2 | |
| 0.16 | 1 | |
| 0.17 | 2 |
| Value | Count | Frequency (%) |
| 1000 | 1 | |
| 985 | 2 | |
| 917.08 | 1 | |
| 847.41 | 1 | |
| 802.87 | 1 | |
| 796.88 | 1 | |
| 768.16 | 1 | |
| 763.58 | 1 | |
| 761.91 | 1 | |
| 743.98 | 1 |
NO
Real number (ℝ)
High correlation 
| Distinct | 5776 |
|---|---|
| Distinct (%) | 19.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.642601 |
| Minimum | 0.02 |
|---|---|
| Maximum | 390.68 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0.02 |
|---|---|
| 5-th percentile | 1.88 |
| Q1 | 6.21 |
| median | 9.89 |
| Q3 | 17.57 |
| 95-th percentile | 57.255 |
| Maximum | 390.68 |
| Range | 390.66 |
| Interquartile range (IQR) | 11.36 |
Descriptive statistics
| Standard deviation | 21.506064 |
|---|---|
| Coefficient of variation (CV) | 1.2922298 |
| Kurtosis | 28.90099 |
| Mean | 16.642601 |
| Median Absolute Deviation (MAD) | 4.67 |
| Skewness | 4.1827986 |
| Sum | 491472.64 |
| Variance | 462.51078 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.89 | 3592 | 12.2% |
| 5.93 | 34 | 0.1% |
| 7.78 | 29 | 0.1% |
| 8.78 | 29 | 0.1% |
| 0.92 | 28 | 0.1% |
| 0.97 | 27 | 0.1% |
| 1.94 | 27 | 0.1% |
| 0.9 | 26 | 0.1% |
| 2.89 | 26 | 0.1% |
| 7.97 | 26 | 0.1% |
| Other values (5766) | 25687 |
| Value | Count | Frequency (%) |
| 0.02 | 7 | |
| 0.03 | 3 | |
| 0.06 | 2 | < 0.1% |
| 0.09 | 2 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.11 | 2 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.14 | 1 | < 0.1% |
| 0.18 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 390.68 | 1 | |
| 382.44 | 1 | |
| 351.3 | 1 | |
| 304.26 | 1 | |
| 289.75 | 1 | |
| 288.55 | 1 | |
| 287.14 | 1 | |
| 273.39 | 1 | |
| 270.09 | 1 | |
| 268.03 | 1 |
NO2
Real number (ℝ)
High correlation 
| Distinct | 7404 |
|---|---|
| Distinct (%) | 25.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.726576 |
| Minimum | 0.01 |
|---|---|
| Maximum | 362.21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 5.4 |
| Q1 | 12.98 |
| median | 21.69 |
| Q3 | 34.665 |
| 95-th percentile | 70.83 |
| Maximum | 362.21 |
| Range | 362.2 |
| Interquartile range (IQR) | 21.685 |
Descriptive statistics
| Standard deviation | 23.050531 |
|---|---|
| Coefficient of variation (CV) | 0.83135152 |
| Kurtosis | 13.252875 |
| Mean | 27.726576 |
| Median Absolute Deviation (MAD) | 10.04 |
| Skewness | 2.697408 |
| Sum | 818793.51 |
| Variance | 531.32698 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21.69 | 3590 | 12.2% |
| 10.58 | 24 | 0.1% |
| 9.42 | 23 | 0.1% |
| 9.14 | 18 | 0.1% |
| 10.21 | 17 | 0.1% |
| 7.14 | 17 | 0.1% |
| 9.44 | 17 | 0.1% |
| 9.47 | 17 | 0.1% |
| 9.24 | 17 | 0.1% |
| 10.09 | 17 | 0.1% |
| Other values (7394) | 25774 |
| Value | Count | Frequency (%) |
| 0.01 | 2 | < 0.1% |
| 0.02 | 5 | |
| 0.03 | 9 | |
| 0.04 | 2 | < 0.1% |
| 0.05 | 3 | < 0.1% |
| 0.06 | 3 | < 0.1% |
| 0.07 | 7 | |
| 0.08 | 5 | |
| 0.09 | 7 | |
| 0.1 | 4 |
| Value | Count | Frequency (%) |
| 362.21 | 1 | |
| 292.02 | 1 | |
| 277.31 | 1 | |
| 273.39 | 1 | |
| 266.46 | 1 | |
| 245.62 | 1 | |
| 241.34 | 1 | |
| 239.18 | 1 | |
| 239.1 | 1 | |
| 237.27 | 1 |
NOx
Real number (ℝ)
High correlation  Zeros 
| Distinct | 8156 |
|---|---|
| Distinct (%) | 27.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.063568 |
| Minimum | 0 |
|---|---|
| Maximum | 467.63 |
| Zeros | 740 |
| Zeros (%) | 2.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.41 |
| Q1 | 14.67 |
| median | 23.52 |
| Q3 | 36.015 |
| 95-th percentile | 90.13 |
| Maximum | 467.63 |
| Range | 467.63 |
| Interquartile range (IQR) | 21.345 |
Descriptive statistics
| Standard deviation | 29.477748 |
|---|---|
| Coefficient of variation (CV) | 0.94894919 |
| Kurtosis | 13.246348 |
| Mean | 31.063568 |
| Median Absolute Deviation (MAD) | 10.14 |
| Skewness | 2.8521721 |
| Sum | 917338.24 |
| Variance | 868.93764 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.52 | 4193 | 14.2% |
| 0 | 740 | 2.5% |
| 4.22 | 208 | 0.7% |
| 6.24 | 115 | 0.4% |
| 4.3 | 35 | 0.1% |
| 2.21 | 31 | 0.1% |
| 4.95 | 19 | 0.1% |
| 4.14 | 18 | 0.1% |
| 4.47 | 17 | 0.1% |
| 4.97 | 16 | 0.1% |
| Other values (8146) | 24139 |
| Value | Count | Frequency (%) |
| 0 | 740 | |
| 0.03 | 4 | < 0.1% |
| 0.04 | 9 | < 0.1% |
| 0.05 | 3 | < 0.1% |
| 0.06 | 2 | < 0.1% |
| 0.07 | 2 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 3 | < 0.1% |
| 0.11 | 2 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 467.63 | 1 | |
| 382.84 | 1 | |
| 378.31 | 1 | |
| 378.24 | 1 | |
| 302.78 | 1 | |
| 293.1 | 1 | |
| 289.09 | 1 | |
| 287.89 | 1 | |
| 273.33 | 1 | |
| 271.94 | 1 |
NH3
Real number (ℝ)
| Distinct | 5922 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.813789 |
| Minimum | 0.01 |
|---|---|
| Maximum | 352.89 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 3.66 |
| Q1 | 12.04 |
| median | 15.85 |
| Q3 | 21.755 |
| 95-th percentile | 53.64 |
| Maximum | 352.89 |
| Range | 352.88 |
| Interquartile range (IQR) | 9.715 |
Descriptive statistics
| Standard deviation | 21.028862 |
|---|---|
| Coefficient of variation (CV) | 1.0103332 |
| Kurtosis | 44.355841 |
| Mean | 20.813789 |
| Median Absolute Deviation (MAD) | 4.37 |
| Skewness | 5.2046589 |
| Sum | 614651.99 |
| Variance | 442.21305 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.85 | 10332 | |
| 6.29 | 36 | 0.1% |
| 6.32 | 29 | 0.1% |
| 6.31 | 28 | 0.1% |
| 6.3 | 28 | 0.1% |
| 6.28 | 27 | 0.1% |
| 6.27 | 24 | 0.1% |
| 10.46 | 23 | 0.1% |
| 6.59 | 22 | 0.1% |
| 6.6 | 21 | 0.1% |
| Other values (5912) | 18961 |
| Value | Count | Frequency (%) |
| 0.01 | 2 | < 0.1% |
| 0.02 | 6 | |
| 0.04 | 1 | < 0.1% |
| 0.05 | 1 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.08 | 2 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.11 | 4 | |
| 0.12 | 3 | |
| 0.13 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 352.89 | 1 | |
| 328.89 | 1 | |
| 323.48 | 1 | |
| 309.04 | 1 | |
| 303.53 | 1 | |
| 302.08 | 1 | |
| 301.28 | 1 | |
| 301.18 | 1 | |
| 297.64 | 1 | |
| 296.43 | 1 |
CO
Real number (ℝ)
Zeros 
| Distinct | 1779 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1538722 |
| Minimum | 0 |
|---|---|
| Maximum | 175.81 |
| Zeros | 2328 |
| Zeros (%) | 7.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.54 |
| median | 0.89 |
| Q3 | 1.38 |
| 95-th percentile | 7.155 |
| Maximum | 175.81 |
| Range | 175.81 |
| Interquartile range (IQR) | 0.84 |
Descriptive statistics
| Standard deviation | 6.7246605 |
|---|---|
| Coefficient of variation (CV) | 3.122126 |
| Kurtosis | 117.79595 |
| Mean | 2.1538722 |
| Median Absolute Deviation (MAD) | 0.4 |
| Skewness | 9.2101442 |
| Sum | 63606 |
| Variance | 45.221058 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2328 | 7.9% |
| 0.89 | 2262 | 7.7% |
| 0.68 | 209 | 0.7% |
| 0.85 | 208 | 0.7% |
| 0.8 | 205 | 0.7% |
| 0.84 | 200 | 0.7% |
| 0.78 | 200 | 0.7% |
| 0.81 | 199 | 0.7% |
| 0.64 | 198 | 0.7% |
| 0.67 | 194 | 0.7% |
| Other values (1769) | 23328 |
| Value | Count | Frequency (%) |
| 0 | 2328 | |
| 0.01 | 59 | 0.2% |
| 0.02 | 59 | 0.2% |
| 0.03 | 56 | 0.2% |
| 0.04 | 30 | 0.1% |
| 0.05 | 48 | 0.2% |
| 0.06 | 42 | 0.1% |
| 0.07 | 40 | 0.1% |
| 0.08 | 34 | 0.1% |
| 0.09 | 38 | 0.1% |
| Value | Count | Frequency (%) |
| 175.81 | 1 | |
| 145.32 | 1 | |
| 134.85 | 1 | |
| 132.47 | 1 | |
| 132.07 | 1 | |
| 124.01 | 1 | |
| 119.68 | 1 | |
| 119.3 | 1 | |
| 118.02 | 1 | |
| 118 | 1 |
SO2
Real number (ℝ)
| Distinct | 4761 |
|---|---|
| Distinct (%) | 16.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.830897 |
| Minimum | 0.01 |
|---|---|
| Maximum | 193.86 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 2.8 |
| Q1 | 6.09 |
| median | 9.16 |
| Q3 | 13.81 |
| 95-th percentile | 43.165 |
| Maximum | 193.86 |
| Range | 193.85 |
| Interquartile range (IQR) | 7.72 |
Descriptive statistics
| Standard deviation | 17.005647 |
|---|---|
| Coefficient of variation (CV) | 1.2295404 |
| Kurtosis | 25.900578 |
| Mean | 13.830897 |
| Median Absolute Deviation (MAD) | 3.51 |
| Skewness | 4.4248517 |
| Sum | 408440.22 |
| Variance | 289.19203 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.16 | 3864 | 13.1% |
| 5.74 | 36 | 0.1% |
| 6.12 | 35 | 0.1% |
| 4.65 | 32 | 0.1% |
| 5.81 | 32 | 0.1% |
| 5.53 | 32 | 0.1% |
| 6.61 | 32 | 0.1% |
| 5.95 | 31 | 0.1% |
| 5.57 | 31 | 0.1% |
| 6.47 | 31 | 0.1% |
| Other values (4751) | 25375 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.04 | 1 | |
| 0.21 | 1 | |
| 0.26 | 1 | |
| 0.36 | 1 | |
| 0.41 | 2 | |
| 0.42 | 1 | |
| 0.44 | 1 | |
| 0.48 | 1 | |
| 0.49 | 1 |
| Value | Count | Frequency (%) |
| 193.86 | 1 | |
| 187.02 | 1 | |
| 186.08 | 1 | |
| 182.39 | 1 | |
| 180.85 | 1 | |
| 179.18 | 1 | |
| 178.93 | 1 | |
| 178.63 | 1 | |
| 178.58 | 1 | |
| 176.88 | 1 |
O3
Real number (ℝ)
| Distinct | 7699 |
|---|---|
| Distinct (%) | 26.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.994121 |
| Minimum | 0.01 |
|---|---|
| Maximum | 257.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 7.67 |
| Q1 | 20.74 |
| median | 30.84 |
| Q3 | 42.73 |
| 95-th percentile | 71.78 |
| Maximum | 257.73 |
| Range | 257.72 |
| Interquartile range (IQR) | 21.99 |
Descriptive statistics
| Standard deviation | 20.202304 |
|---|---|
| Coefficient of variation (CV) | 0.59428817 |
| Kurtosis | 4.5298247 |
| Mean | 33.994121 |
| Median Absolute Deviation (MAD) | 10.92 |
| Skewness | 1.4959537 |
| Sum | 1003880.4 |
| Variance | 408.13307 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30.84 | 4030 | 13.6% |
| 16.48 | 17 | 0.1% |
| 22.14 | 15 | 0.1% |
| 23.6 | 15 | 0.1% |
| 18.33 | 14 | < 0.1% |
| 19.64 | 14 | < 0.1% |
| 13.14 | 13 | < 0.1% |
| 22.94 | 13 | < 0.1% |
| 19.68 | 13 | < 0.1% |
| 32.06 | 13 | < 0.1% |
| Other values (7689) | 25374 |
| Value | Count | Frequency (%) |
| 0.01 | 4 | |
| 0.02 | 7 | |
| 0.03 | 2 | < 0.1% |
| 0.04 | 3 | < 0.1% |
| 0.05 | 2 | < 0.1% |
| 0.06 | 3 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.1 | 8 | |
| 0.11 | 2 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 257.73 | 1 | |
| 200.41 | 1 | |
| 193.31 | 1 | |
| 186.07 | 1 | |
| 177.07 | 1 | |
| 175.04 | 1 | |
| 172.28 | 1 | |
| 169.36 | 1 | |
| 169.35 | 1 | |
| 165.48 | 1 |
Benzene
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 1873 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.859874 |
| Minimum | 0 |
|---|---|
| Maximum | 455.03 |
| Zeros | 3802 |
| Zeros (%) | 12.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.24 |
| median | 1.07 |
| Q3 | 2.42 |
| 95-th percentile | 8.23 |
| Maximum | 455.03 |
| Range | 455.03 |
| Interquartile range (IQR) | 2.18 |
Descriptive statistics
| Standard deviation | 14.252822 |
|---|---|
| Coefficient of variation (CV) | 4.9837235 |
| Kurtosis | 653.45478 |
| Mean | 2.859874 |
| Median Absolute Deviation (MAD) | 0.93 |
| Skewness | 23.63338 |
| Sum | 84454.94 |
| Variance | 203.14292 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.07 | 5667 | 19.2% |
| 0 | 3802 | 12.9% |
| 0.03 | 300 | 1.0% |
| 0.02 | 292 | 1.0% |
| 0.01 | 217 | 0.7% |
| 0.04 | 190 | 0.6% |
| 0.05 | 176 | 0.6% |
| 2 | 170 | 0.6% |
| 0.09 | 170 | 0.6% |
| 0.1 | 167 | 0.6% |
| Other values (1863) | 18380 |
| Value | Count | Frequency (%) |
| 0 | 3802 | |
| 0.01 | 217 | 0.7% |
| 0.02 | 292 | 1.0% |
| 0.03 | 300 | 1.0% |
| 0.04 | 190 | 0.6% |
| 0.05 | 176 | 0.6% |
| 0.06 | 146 | 0.5% |
| 0.07 | 123 | 0.4% |
| 0.08 | 157 | 0.5% |
| 0.09 | 170 | 0.6% |
| Value | Count | Frequency (%) |
| 455.03 | 1 | |
| 454.85 | 1 | |
| 449.38 | 1 | |
| 448.59 | 1 | |
| 445.83 | 1 | |
| 443.63 | 1 | |
| 438.01 | 1 | |
| 435.9 | 1 | |
| 435.09 | 1 | |
| 432.94 | 1 |
Toluene
Real number (ℝ)
High correlation  Zeros 
| Distinct | 3608 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.1404849 |
| Minimum | 0 |
|---|---|
| Maximum | 454.85 |
| Zeros | 2861 |
| Zeros (%) | 9.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.28 |
| median | 2.97 |
| Q3 | 6.02 |
| 95-th percentile | 31.46 |
| Maximum | 454.85 |
| Range | 454.85 |
| Interquartile range (IQR) | 4.74 |
Descriptive statistics
| Standard deviation | 17.224737 |
|---|---|
| Coefficient of variation (CV) | 2.4122644 |
| Kurtosis | 290.69125 |
| Mean | 7.1404849 |
| Median Absolute Deviation (MAD) | 2.03 |
| Skewness | 13.490402 |
| Sum | 210865.66 |
| Variance | 296.69157 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.97 | 8058 | |
| 0 | 2861 | 9.7% |
| 0.02 | 111 | 0.4% |
| 0.03 | 102 | 0.3% |
| 0.05 | 99 | 0.3% |
| 0.04 | 86 | 0.3% |
| 1.1 | 83 | 0.3% |
| 6 | 79 | 0.3% |
| 0.08 | 76 | 0.3% |
| 0.06 | 72 | 0.2% |
| Other values (3598) | 17904 |
| Value | Count | Frequency (%) |
| 0 | 2861 | |
| 0.01 | 70 | 0.2% |
| 0.02 | 111 | 0.4% |
| 0.03 | 102 | 0.3% |
| 0.04 | 86 | 0.3% |
| 0.05 | 99 | 0.3% |
| 0.06 | 72 | 0.2% |
| 0.07 | 61 | 0.2% |
| 0.08 | 76 | 0.3% |
| 0.09 | 54 | 0.2% |
| Value | Count | Frequency (%) |
| 454.85 | 1 | |
| 454.12 | 1 | |
| 449.14 | 1 | |
| 448.87 | 1 | |
| 445.84 | 1 | |
| 443.63 | 1 | |
| 437.77 | 1 | |
| 435.94 | 1 | |
| 434.92 | 1 | |
| 433.02 | 1 |
AQI
Real number (ℝ)
High correlation 
| Distinct | 829 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 158.78155 |
| Minimum | 13 |
|---|---|
| Maximum | 2049 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 230.8 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 52 |
| Q1 | 88 |
| median | 118 |
| Q3 | 179 |
| 95-th percentile | 391 |
| Maximum | 2049 |
| Range | 2036 |
| Interquartile range (IQR) | 91 |
Descriptive statistics
| Standard deviation | 130.27241 |
|---|---|
| Coefficient of variation (CV) | 0.82045056 |
| Kurtosis | 25.833384 |
| Mean | 158.78155 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 3.7697515 |
| Sum | 4688978 |
| Variance | 16970.902 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 118 | 4829 | 16.4% |
| 102 | 223 | 0.8% |
| 100 | 222 | 0.8% |
| 70 | 208 | 0.7% |
| 106 | 208 | 0.7% |
| 78 | 198 | 0.7% |
| 98 | 195 | 0.7% |
| 104 | 192 | 0.7% |
| 66 | 192 | 0.7% |
| 80 | 190 | 0.6% |
| Other values (819) | 22874 |
| Value | Count | Frequency (%) |
| 13 | 1 | < 0.1% |
| 14 | 3 | < 0.1% |
| 15 | 3 | < 0.1% |
| 16 | 4 | < 0.1% |
| 17 | 7 | < 0.1% |
| 18 | 2 | < 0.1% |
| 19 | 27 | |
| 20 | 29 | |
| 21 | 7 | < 0.1% |
| 22 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 2049 | 1 | |
| 1917 | 1 | |
| 1842 | 1 | |
| 1747 | 1 | |
| 1719 | 1 | |
| 1672 | 1 | |
| 1646 | 1 | |
| 1630 | 1 | |
| 1613 | 1 | |
| 1595 | 1 |
AQI_Bucket
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Moderate | |
|---|---|
| Satisfactory | |
| Poor | |
| Very Poor | |
| Good | 1341 |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 8.5441401 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moderate |
|---|---|
| 2nd row | Moderate |
| 3rd row | Moderate |
| 4th row | Moderate |
| 5th row | Moderate |
Common Values
| Value | Count | Frequency (%) |
| Moderate | 13510 | |
| Satisfactory | 8224 | |
| Poor | 2781 | 9.4% |
| Very Poor | 2337 | 7.9% |
| Good | 1341 | 4.5% |
| Severe | 1338 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| moderate | 13510 | |
| satisfactory | 8224 | |
| poor | 5118 | 16.1% |
| very | 2337 | 7.3% |
| good | 1341 | 4.2% |
| severe | 1338 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 34652 | |
| e | 33371 | |
| r | 30527 | |
| a | 29958 | |
| t | 29958 | |
| d | 14851 | 5.9% |
| M | 13510 | 5.4% |
| y | 10561 | 4.2% |
| S | 9562 | 3.8% |
| i | 8224 | 3.3% |
| Other values (8) | 37143 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 252317 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 34652 | |
| e | 33371 | |
| r | 30527 | |
| a | 29958 | |
| t | 29958 | |
| d | 14851 | 5.9% |
| M | 13510 | 5.4% |
| y | 10561 | 4.2% |
| S | 9562 | 3.8% |
| i | 8224 | 3.3% |
| Other values (8) | 37143 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 252317 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 34652 | |
| e | 33371 | |
| r | 30527 | |
| a | 29958 | |
| t | 29958 | |
| d | 14851 | 5.9% |
| M | 13510 | 5.4% |
| y | 10561 | 4.2% |
| S | 9562 | 3.8% |
| i | 8224 | 3.3% |
| Other values (8) | 37143 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 252317 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 34652 | |
| e | 33371 | |
| r | 30527 | |
| a | 29958 | |
| t | 29958 | |
| d | 14851 | 5.9% |
| M | 13510 | 5.4% |
| y | 10561 | 4.2% |
| S | 9562 | 3.8% |
| i | 8224 | 3.3% |
| Other values (8) | 37143 |
Interactions
Correlations
| AQI | AQI_Bucket | Benzene | CO | City | NH3 | NO | NO2 | NOx | O3 | PM10 | PM2.5 | SO2 | Toluene | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AQI | 1.000 | 0.578 | 0.215 | 0.465 | 0.208 | 0.275 | 0.428 | 0.424 | 0.423 | 0.267 | 0.675 | 0.813 | 0.349 | 0.283 |
| AQI_Bucket | 0.578 | 1.000 | 0.023 | 0.260 | 0.350 | 0.091 | 0.182 | 0.235 | 0.197 | 0.144 | 0.317 | 0.420 | 0.195 | 0.123 |
| Benzene | 0.215 | 0.023 | 1.000 | 0.242 | 0.099 | 0.089 | 0.218 | 0.280 | 0.237 | 0.137 | 0.210 | 0.190 | 0.148 | 0.710 |
| CO | 0.465 | 0.260 | 0.242 | 1.000 | 0.174 | 0.166 | 0.317 | 0.243 | 0.306 | 0.064 | 0.229 | 0.328 | 0.223 | 0.315 |
| City | 0.208 | 0.350 | 0.099 | 0.174 | 1.000 | 0.216 | 0.136 | 0.177 | 0.171 | 0.163 | 0.220 | 0.163 | 0.218 | 0.129 |
| NH3 | 0.275 | 0.091 | 0.089 | 0.166 | 0.216 | 1.000 | 0.273 | 0.389 | 0.231 | 0.169 | 0.293 | 0.294 | 0.067 | 0.069 |
| NO | 0.428 | 0.182 | 0.218 | 0.317 | 0.136 | 0.273 | 1.000 | 0.466 | 0.702 | -0.057 | 0.396 | 0.397 | 0.330 | 0.184 |
| NO2 | 0.424 | 0.235 | 0.280 | 0.243 | 0.177 | 0.389 | 0.466 | 1.000 | 0.588 | 0.295 | 0.405 | 0.427 | 0.226 | 0.324 |
| NOx | 0.423 | 0.197 | 0.237 | 0.306 | 0.171 | 0.231 | 0.702 | 0.588 | 1.000 | 0.037 | 0.401 | 0.392 | 0.316 | 0.257 |
| O3 | 0.267 | 0.144 | 0.137 | 0.064 | 0.163 | 0.169 | -0.057 | 0.295 | 0.037 | 1.000 | 0.226 | 0.260 | 0.187 | 0.195 |
| PM10 | 0.675 | 0.317 | 0.210 | 0.229 | 0.220 | 0.293 | 0.396 | 0.405 | 0.401 | 0.226 | 1.000 | 0.684 | 0.302 | 0.235 |
| PM2.5 | 0.813 | 0.420 | 0.190 | 0.328 | 0.163 | 0.294 | 0.397 | 0.427 | 0.392 | 0.260 | 0.684 | 1.000 | 0.262 | 0.226 |
| SO2 | 0.349 | 0.195 | 0.148 | 0.223 | 0.218 | 0.067 | 0.330 | 0.226 | 0.316 | 0.187 | 0.302 | 0.262 | 1.000 | 0.239 |
| Toluene | 0.283 | 0.123 | 0.710 | 0.315 | 0.129 | 0.069 | 0.184 | 0.324 | 0.257 | 0.195 | 0.235 | 0.226 | 0.239 | 1.000 |
Missing values
Sample
| City | Date | PM2.5 | PM10 | NO | NO2 | NOx | NH3 | CO | SO2 | O3 | Benzene | Toluene | AQI | AQI_Bucket | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Ahmedabad | 1/1/2015 | 48.57 | 95.68 | 0.92 | 18.22 | 17.15 | 15.85 | 0.92 | 27.64 | 133.36 | 0.00 | 0.02 | 118.0 | Moderate |
| 1 | Ahmedabad | 1/2/2015 | 48.57 | 95.68 | 0.97 | 15.69 | 16.46 | 15.85 | 0.97 | 24.55 | 34.06 | 3.68 | 5.50 | 118.0 | Moderate |
| 2 | Ahmedabad | 1/3/2015 | 48.57 | 95.68 | 17.40 | 19.30 | 29.70 | 15.85 | 17.40 | 29.07 | 30.70 | 6.80 | 16.40 | 118.0 | Moderate |
| 3 | Ahmedabad | 1/4/2015 | 48.57 | 95.68 | 1.70 | 18.48 | 17.97 | 15.85 | 1.70 | 18.59 | 36.08 | 4.43 | 10.14 | 118.0 | Moderate |
| 4 | Ahmedabad | 1/5/2015 | 48.57 | 95.68 | 22.10 | 21.42 | 37.76 | 15.85 | 22.10 | 39.33 | 39.31 | 7.01 | 18.89 | 118.0 | Moderate |
| 5 | Ahmedabad | 1/6/2015 | 48.57 | 95.68 | 45.41 | 38.48 | 81.50 | 15.85 | 45.41 | 45.76 | 46.51 | 5.42 | 10.83 | 118.0 | Moderate |
| 6 | Ahmedabad | 1/7/2015 | 48.57 | 95.68 | 112.16 | 40.62 | 130.77 | 15.85 | 112.16 | 32.28 | 33.47 | 0.00 | 0.00 | 118.0 | Moderate |
| 7 | Ahmedabad | 1/8/2015 | 48.57 | 95.68 | 80.87 | 36.74 | 96.75 | 15.85 | 80.87 | 38.54 | 31.89 | 0.00 | 0.00 | 118.0 | Moderate |
| 8 | Ahmedabad | 1/9/2015 | 48.57 | 95.68 | 29.16 | 31.00 | 48.00 | 15.85 | 29.16 | 58.68 | 25.75 | 0.00 | 0.00 | 118.0 | Moderate |
| 9 | Ahmedabad | 1/10/2015 | 48.57 | 95.68 | 9.89 | 7.04 | 0.00 | 15.85 | 0.89 | 8.29 | 4.55 | 0.00 | 0.00 | 118.0 | Moderate |
| City | Date | PM2.5 | PM10 | NO | NO2 | NOx | NH3 | CO | SO2 | O3 | Benzene | Toluene | AQI | AQI_Bucket | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29521 | Visakhapatnam | 6/22/2020 | 33.17 | 108.22 | 5.58 | 42.45 | 27.06 | 13.70 | 0.73 | 13.65 | 34.85 | 3.99 | 10.24 | 95.0 | Satisfactory |
| 29522 | Visakhapatnam | 6/23/2020 | 25.40 | 83.38 | 2.76 | 34.09 | 19.92 | 13.13 | 0.54 | 10.40 | 43.27 | 2.88 | 12.03 | 100.0 | Satisfactory |
| 29523 | Visakhapatnam | 6/24/2020 | 34.36 | 90.90 | 1.22 | 23.38 | 13.12 | 14.45 | 0.56 | 10.92 | 35.12 | 2.99 | 3.15 | 86.0 | Satisfactory |
| 29524 | Visakhapatnam | 6/25/2020 | 13.45 | 58.54 | 2.30 | 21.60 | 13.09 | 12.27 | 0.41 | 8.19 | 29.38 | 1.28 | 5.64 | 77.0 | Satisfactory |
| 29525 | Visakhapatnam | 6/26/2020 | 7.63 | 32.27 | 5.91 | 23.27 | 17.19 | 11.15 | 0.46 | 6.87 | 19.90 | 1.45 | 5.37 | 47.0 | Good |
| 29526 | Visakhapatnam | 6/27/2020 | 15.02 | 50.94 | 7.68 | 25.06 | 19.54 | 12.47 | 0.47 | 8.55 | 23.30 | 2.24 | 12.07 | 41.0 | Good |
| 29527 | Visakhapatnam | 6/28/2020 | 24.38 | 74.09 | 3.42 | 26.06 | 16.53 | 11.99 | 0.52 | 12.72 | 30.14 | 0.74 | 2.21 | 70.0 | Satisfactory |
| 29528 | Visakhapatnam | 6/29/2020 | 22.91 | 65.73 | 3.45 | 29.53 | 18.33 | 10.71 | 0.48 | 8.42 | 30.96 | 0.01 | 0.01 | 68.0 | Satisfactory |
| 29529 | Visakhapatnam | 6/30/2020 | 16.64 | 49.97 | 4.05 | 29.26 | 18.80 | 10.03 | 0.52 | 9.84 | 28.30 | 0.00 | 0.00 | 54.0 | Satisfactory |
| 29530 | Visakhapatnam | 7/1/2020 | 15.00 | 66.00 | 0.40 | 26.85 | 14.05 | 5.20 | 0.59 | 2.10 | 17.05 | 1.07 | 2.97 | 50.0 | Good |